254 research outputs found

    PART: Pre-trained Authorship Representation Transformer

    Full text link
    Authors writing documents imprint identifying information within their texts: vocabulary, registry, punctuation, misspellings, or even emoji usage. Finding these details is very relevant to profile authors, relating back to their gender, occupation, age, and so on. But most importantly, repeating writing patterns can help attributing authorship to a text. Previous works use hand-crafted features or classification tasks to train their authorship models, leading to poor performance on out-of-domain authors. A better approach to this task is to learn stylometric representations, but this by itself is an open research challenge. In this paper, we propose PART: a contrastively trained model fit to learn \textbf{authorship embeddings} instead of semantics. By comparing pairs of documents written by the same author, we are able to determine the proprietary of a text by evaluating the cosine similarity of the evaluated documents, a zero-shot generalization to authorship identification. To this end, a pre-trained Transformer with an LSTM head is trained with the contrastive training method. We train our model on a diverse set of authors, from literature, anonymous blog posters and corporate emails; a heterogeneous set with distinct and identifiable writing styles. The model is evaluated on these datasets, achieving zero-shot 72.39\% and 86.73\% accuracy and top-5 accuracy respectively on the joint evaluation dataset when determining authorship from a set of 250 different authors. We qualitatively assess the representations with different data visualizations on the available datasets, profiling features such as book types, gender, age, or occupation of the author

    An online failure prediction system for private IaaS platforms

    Get PDF
    The size and complexity of cloud environments make them prone to failures. The traditional approach to achieve a high dependability for these systems relies on constant monitoring. However, this method is purely reactive. A more proactive approach is provided by online failure prediction (OFP) techniques. In this paper, we describe a OFP system for private IaaS platforms, currently under development, that combines di_erent types of data input, including monitoring information, event logs, and failure data. In addition, this system operates at both the physical and virtual planes of the cloud, taking into account the relationships between nodes and failure propagation mechanisms that are unique to cloud environments

    Tendencias en el número de médicos titulados anualmente en el Perú, 2007-2016

    Get PDF
    Objetivo: Describir la tendencia en el número de médicos que se titularon durante el periodo 2007-2016 en Perú, en forma general y en subgrupos de acuerdo a las características ligadas a la universidad en la que cursaron los estudios de pregrado. Materiales y métodos: Estudio descriptivo. Se obtuvo el listado de todos los médicos colegiados entre 2007-2016 por medio de la página web del Colegio Médico del Perú; mientras que la fecha de titulación y universidad de procedencia provino de la página web de la Superintendencia Nacional de Educación Superior Universitaria (SUNEDU). Para evaluar las tendencias, se utilizó la prueba de correlación de Spearman. Resultados: En el periodo de estudio se colegiaron 27 611 médicos a nivel nacional, con una tendencia anual creciente en la cantidad de médicos titulados (p<0,001). Entre los egresados de universidades peruanas, se encontró un incremento del número de médicos que estudiaron en universidades de Lima (p<0,001) y de la región costa (p<0,001). Adicionalmente, se evidenció un incremento en la cantidad de titulados provenientes de universidades privadas de Lima (p<0,001) y de provincias (p<0,001). Conclusiones: El número de médicos titulados aumenta anualmente, con predominio de aquellos provenientes de universidades de Lima, la costa y universidades privadas. Se evidencia la necesidad urgente de políticas que regulen este crecimiento, con la finalidad de evitar problemas de calidad educativa y empleabilidad

    Abstracts from the Food Allergy and Anaphylaxis Meeting 2016

    Get PDF

    Mechanisms and Regulation of Mitotic Recombination in Saccharomyces cerevisiae

    No full text

    Measurement of prompt D+D^+ and Ds+D^+_{s} production in pPbp\mathrm{Pb} collisions at sNN=5.02\sqrt {s_{\mathrm{NN}}}=5.02\,TeV

    No full text
    International audienceThe production of prompt D+D^+ and Ds+D^+_{s} mesons is studied in proton-lead collisions at a centre-of-mass energy of sNN=5.02\sqrt {s_{\mathrm{NN}}}=5.02\,TeV. The data sample corresponding to an integrated luminosity of (1.58±0.02)nb1(1.58\pm0.02)\mathrm{nb}^{-1} is collected by the LHCb experiment at the LHC. The differential production cross-sections are measured using D+D^+ and Ds+D^+_{s} candidates with transverse momentum in the range of 0<pT<14GeV/c0<p_{\mathrm{T}} <14\,\mathrm{GeV}/c and rapidities in the ranges of 1.5<y<4.01.5<y^*<4.0 and 5.0<y<2.5-5.0<y^*<-2.5 in the nucleon-nucleon centre-of-mass system. For both particles, the nuclear modification factor and the forward-backward production ratio are determined. These results are compared with theoretical models that include initial-state nuclear effects. In addition, measurements of the cross-section ratios between D+D^+, Ds+D^+_{s} and D0D^0 mesons are presented, providing a baseline for studying the charm hadronization in lead-lead collisions at LHC energies

    Measurement of prompt D+D^+ and Ds+D^+_{s} production in pPbp\mathrm{Pb} collisions at sNN=5.02\sqrt {s_{\mathrm{NN}}}=5.02\,TeV

    No full text
    International audienceThe production of prompt D+D^+ and Ds+D^+_{s} mesons is studied in proton-lead collisions at a centre-of-mass energy of sNN=5.02\sqrt {s_{\mathrm{NN}}}=5.02\,TeV. The data sample corresponding to an integrated luminosity of (1.58±0.02)nb1(1.58\pm0.02)\mathrm{nb}^{-1} is collected by the LHCb experiment at the LHC. The differential production cross-sections are measured using D+D^+ and Ds+D^+_{s} candidates with transverse momentum in the range of 0<pT<14GeV/c0<p_{\mathrm{T}} <14\,\mathrm{GeV}/c and rapidities in the ranges of 1.5<y<4.01.5<y^*<4.0 and 5.0<y<2.5-5.0<y^*<-2.5 in the nucleon-nucleon centre-of-mass system. For both particles, the nuclear modification factor and the forward-backward production ratio are determined. These results are compared with theoretical models that include initial-state nuclear effects. In addition, measurements of the cross-section ratios between D+D^+, Ds+D^+_{s} and D0D^0 mesons are presented, providing a baseline for studying the charm hadronization in lead-lead collisions at LHC energies
    corecore